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Abstract 

We compare our results on empirical analysis of financial data with simula- 
tions of two stochastic models of the dynamics of stock market prices. The 
two models are (i) the truncated Levy flight recently introduced by us and 
(ii) the ARCH(l) and GARCH(1,1) processes. We find that the TLF well 
describes the scaling and its breakdown observed in empirical data, while 
it is not able to properly describe the fluctuations of volatility empirically 
detected. The ARCH(l) and GARCH(1,1) models are able to describe the 
probability density function of price changes at a given time horizon, but both 
fail to describe the scaling properties of the PDFs for short time horizons. 
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I. INTRODUCTION 



Inspired by pioneering works on the analysis and modeling of economic and financial 
systems [|l]-[|], a growing number of physicists are becoming involved in the analysis and 
modeling of financial markets [f§-|l!J. In this lecture we consider the price dynamics of a 
stock index traded in a financial market. The most accepted paradigm in finance is that 
no arbitrage is present in financial markets i.e. there is no way to extract money from the 
market continuously and without risk PD[ . 

In this lecture, we firstly recall results obtained by us JJ, IT |, by performing an empirical 
analysis of high-frequency data of one of the most important indices of the New York Stock 
Exchange, the Standard & Poor's 500 (S&P500) index. The results obtained in the empirical 
analysis are used as benchmarks for two stochastic processes used to model price dynamics 
in financial markets. The first model is the truncated Levy flight (TLF), recently introduced 
by us [plj. The second stochastic process belongs to the class of autoregressive conditional 



heteroscedasticity (ARCH) models and to its generalization (GARCH) [p3|l . We address 
strengths and weaknesses of all three models in describing real financial data. We focus 
our attention on the probability density functions (PDFs) of price changes at different time 
horizons, on the scaling properties of the PDFs and on the degree of stationarity of index 
changes. 



II. EMPIRICAL ANALYSIS 



We performed empirical analyses of the dynamics of indices of stock prices traded in 
financial markets HJ9|JTT||. Our empirical analysis fl,|TT| of the S&P500 Index of the New 
York Stock Exchange shows that a non-Gaussian scaling of the PDF of price changes is 
present at short times (from At = 1 to At = 1000 trading minutes) while a breakdown from 
the non-Gaussian scaling is present for long times (At » 1000 trading minutes) [§,0]- We 
performed our analysis by analyzing high frequency data recorded during the 6-year period 
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1/84-12/89 (time intervals between successive records as short as 15 seconds are present in 
the data base). In our analysis ||, we define the trading time as a continuous time starting 
from the opening of the day until the closing, and then continuing with the opening of the 
next trading day. From this data base, we select the complete set of non-overlapping records 
separated by a time interval At±eAt (where e is the tolerance, always less than 0.035). We 
denote the value of the S & P 500 as y(t), and we define z(t) = y(t) — y(t — At). In our 
analysis, we determine || the probability distribution P(z) of index variations for different 
values of At. We select At values that are logarithmically equally spaced ranging from 1 
to 1000 min. The number of data in each set is decreasing from the maximum value of 
493,545 {At = 1 min) to the minimum value of 562 (At = 1000 min). We note that the 
distributions are non-Gaussian, indeed, they have wings larger than expected for a normal 
process. 

We study the "probability of return to the origin" P(z = 0) as a function of At. With this 
choice, we are investigating the point of each probability distribution that is least affected 
by the noise introduced by the finiteness of the experimental data set. Our investigation 
of .P(O) versus At in a log-log plot |5| shows that the data are well-fit by a straight line 
characterized by the slope —0.712 ±0.025. We observe a non-normal scaling behavior (slope 
7^ —0.5) in an interval of trading time ranging from 1 to 1000 min. 

For short time horizons (from At = 1 to At = 1000 minutes), this empirical finding 
agrees with the model of a Levy flight proposed by Mandelbrot in 1963 to model cotton 
price dynamics M or with the model of a Levy walk ||24j| . In fact, if the central region of the 



distribution is well-described by a Levy stable symmetrical distribution [25 



1 f°° 

L a (z,At) = — / exp(— 'jAtq a ) cos(qz)dq, (1) 
7T Jo 

of index a and scale factor 7 at At = 1 , then the probability of return is given by 

By using the value —0.712 from the analysis of the probability of return, we obtain the index 
a = 1.40 ±0.05 [|. 



We also check if the scaling extends over the entire probability distribution as well as 
z — 0. All the distributions (with At = 1 to At = 1000 minutes) agree well with a Levy 
stable distribution P,p6f. The distributions obtained with the highest temporal resolution 



(At < 10) show that in addition to the good agreement with the Levy (non-Gaussian) 
profile observed for almost three orders of magnitude, an approximately exponential fall-off 
is present. The clear deviation of the tails of the distribution from the Levy profile shows 
that the experimental tails are less fat than expected for a Levy distribution. 

The Levy distribution has an infinite second moment (if a < 2) ||25|| . However, our 
empirical finding of an exponential (or stretched exponential) fall-off implies that the second 
moment is finite. This conclusion might at first sight seem to contradict our observation of 
Levy scaling of the central part of the price difference distribution over fully three orders of 
magnitude. However, the contradiction is more apparent than real since, for example, the 
above findings are consistent with the theoretical predictions of the truncated Levy flight 

Ea- 
rn. THE TRUNCATED LEVY FLIGHT 

The truncated Levy flight (TLF) has been introduced by Mantegna and Stanley in 
Ref. IfnU . A TLF is defined as a stochastic process {x} characterized by the following 
probability density function 

{0 x > £ 

ciL(x) -t < x < t , (3) 
x <-£ 

where L(x) is the symmetrical Levy stable distribution of index a (0 < a < 2) and scale 
factor 7 (7 > 0), Ci is a normalizing constant and i is the cutoff length. In the following 
theoretical considerations, for the sake of simplicity, we set 7 = 1. 

The central limit theorem (CLT) is fundamental to statistical mechanics. It states that 
when n — > 00, the sum 
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Z n = ^Xi (4) 
i=l 

of n stochastic variables {x} that are statistically independent, identically distributed and 
with a finite variance, converges to a normal (Gaussian) stochastic process. Generally, 
n pd 10 is sufficient to ensure convergence. In a dynamical system, Eq. (|j) defines a random 
walk if the variable x is the jump size performed after a time interval At and n is the 
number of time intervals. Here, the "number of variables" n and the "time" t = nAt can 
be interchanged everywhere. 

For low values of n, P(z n = 0) takes a value very close to the one expected for a Levy 
stable process 

P(z n = 0)^L(z n = 0) = ^^. (5) 



nan 



For large values of n, P(z n = 0) assumes the value predicted for a normal process, 

P(z n = 0) ~ N(z n = 0) = -= — -, (6) 



where a (a,£) is the standard deviation of the TLF stochastic process {x}. 

In the interval 1 < a < 2, the crossover between the two regimes has been determined 



in Ref. |2| as: 

n x «if, (7) 

where A is a function of a (the explicit form is given in Ref. |]2]]])- The description of the 
convergence process does not depend crucially on the exact shape of the cut-off |27] and some 
results of Ref. |2l| have been confirmed analytically for an exponential cut-off in Ref. |28 |. 



By performing numerical simulations, we verified J2l| that the probability of return to 



the origin indicates with high accuracy the degree of convergence of the process to one of 
the two asymptotic regimes. 

The TLF model explains the empirical observations of (i) non-Gaussian scaling of the 
PDFs of price changes for short times; (ii) Levy shape of the central part of the price 



change distributions for At < 1000 trading minutes; (iii) gradual convergence to a Gaussian 
process for long time horizons (At » 1000 trading minutes). However, not all the features 
observed in the S&P 500 dynamics are described by the TLF model. The simplest version 
of the model cannot describe the short time memory (of the order of 20 minutes or less) 
observed in the empirical data [TT| , |2"6[ and also does not explain the empirical observation of 
the time dependence of the parameter 7 which is fluctuating with burst of activity localized 
in specific months f^fj. The 7 parameter is related to what is called "volatility" in the 
economic literature |29| . 



IV. ARCH PROCESS 



ARCH stochastic models were introduced by Engle in 1982 They are stochastic 

models with autoregressive conditional heteroscedasticity, namely zero mean, uncorrelated 
stochastic processes with nonconstant variances conditional on the past. These models have 
a very interesting property: they might be locally unstationary (for short time intervals) but 
globally stationary for well defined ranges of the values of the control parameters. They are 
widely known in the economic literature []30 |, but they are almost unknown to the physics 



community in spite of the fact that they might also be useful in the description of physical 
problems. 

The simplest ARCH model is the ARCH(l) model defined as a random variable Z which 
is characterized at time t by a variance of given by 

of = a + OL\Z 2 t _ x (8) 

where Z t _i is a random variable selected from a set of random variables characterized by a 
Gaussian distribution with zero mean and standar deviation <Jt-i- «o and a\ are the control 
parameters of the stochastic process. 

The most general ARCH stochastic process, the ARCH(n) process is defined by 
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2 = a + a x Zl_ x + + a n Z 2 t _ n (9) 



where ao, a„ are control parameters and Z t -i, Z t - n are random variables drawn from 
sets of random variables with Gaussian distributions of zero mean and standard deviations 
0t_i, <Jt-n respectively. In spite of the fact that 07 is showing an intermittent-like behav- 
ior, the overall process {Z} is stationary on a long time scale for a wide range of the control 
parameters. For example, it has been proven by Engle that the ARCH(l) process has finite 
variance for a± < 1, and finite fourth moment for 3a 2 < 1 |^2| . 



We simulate several ARCH(l) process to investigate the dynamics of the unconditional 
probability density function P(Z n At) at different time horizons nAt [Z n ^ t = Y^l=\Zt-i)- 
For each simulation we also study the scaling properties of the "probability of return to the 
origin" P{Z n ^ t = 0) as a function of nAt. We select the values of the control parameters to 
investigate ARCH(l) processes which are characterized by the same unconditional variance 
observed in our empirical investigation of the S&P500 dynamics (namely a 2 = 2.57 • 10~ 3 ) 
and by different values of the kurtosis k of the {Z} process. For an ARCH(l) process the 
unconditional variance is given by |22| 



a 2 = -25- (10) 
1 — Cti 



while the kurtosis is P2~ 



* = ^$ (id 

1 — 3af 

We focus our attention on three cases: 

(i) ao = 0.00231 and a\ = 0.1 . In this case a 2 = 2.57 • 10~ 3 and k = 3.06. The value of 
k is very close to the one expected for a Gaussian stochastic process (k = 3); 

(ii) «o = 0.00112 and a± = 0.564. With these values of the control parameters the 
variance and the kurtosis are a 2 = 2.57 ■ 10~ 3 and k = 43. This value of k is approximately 
the same value observed in the empirical analysis of the S&P500 changes for time intervals 
At = 1 minute; 

(iii) a = 0.00109 and «i = 0.575 . Values of the control parameters implying the same 
variance as above but a very high value for the kurtosis (k = 247). 



By varying the values of the control parameters, it is of course possible to make the shape 
of the PDF P(Z) more leptokurtic than a Gaussian distribution. The presence of a given 
degree of leptokurtosis does not imply directly scaling properties of P(Z n & t ) PDFs strongly 
different from the Gaussian scaling. By studying the "probability of return to the origin", 
we find that an approximate scaling behavior is present in ARCH(l) stochastic processes for 
short times (nAt < 100). We find that the values of the scaling exponent best describing the 
above-cited time evolution of P(Z n & t = 0) are 2.02, 1.93 and 1.85 respectively. These values 
are very close to the scaling exponent 2 observed for a Gaussian stochastic process. Hence 
an ARCH(l) process is not able to describe the scaling properties empirically observed in 
the stochastic dynamics of the S&P500 for At < 1000 minutes (where the scaling exponent 
is 1.4). 



V. GARCH PROCESS 

ARCH(l) model is the simplest autoregressive model. In the following, we will consider 
a less simple autoregressive model, the GARCH(1,1) model. The GARCH(1,1) model is 
widely studied in the economic literature ppf . In 1986 generalized ARCH or GARCH(p,q) 
models were proposed P3"| . These models are more flexible than ARCH models in the lag 
structure. They are defined by the relation 

a 2 t =a + ctxZ>t-\ + + UpZt-p + Pi°t-i + + M-q ( 12 ) 

where the constants a , a p , Pi, f3 q are the control parameters of the GARCH stochastic 
process. The simplest GARCH process, the GARCH(1,1), is often studied in the modeling 
of prices of financial assets. GARCH(1,1) processes are unconditional stationary with finite 
variance and fourth moment if 1 — a.i — fi\ > and 1 — j3f — 2ai(3\ — 3af > respectively. 
Empirical analyses of stock market price data have shown (see for example |H]]) that a good 
choice of the parameter (3\ is (3\ = 0.9 . Accordingly, we set (3\ = 0.9 in our simulations 
and we set the remaining control parameters «o and ot\ to the values «o = 2.3 10~ 5 and 



cti = 0.09105. With this choice of control parameters, the unconditional variance [32 



of the process {Z} is approximately equal to the value observed in the S&P500 data. The 



kurtosis 32 
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K = i _ ^ _ 2^ - 3a? + 3 (M) 
also assumes the value measured in the empirical analysis of the S&P500 data (k = 43). 
The simulated GARCH(1,1) process has an unconditional PDF P(Z& t ) which mimics very 
well the one observed in the S&P500 data with a time interval At = 1 minute |J. We also 
study the probability of return to the origin to determine if scaling is observed for the PDFs 
of this process. We observe a scaling behavior for a wide range of time (nAt < 10,000). 
The measured scaling exponent is 1.88, a value close to the Gaussian scaling exponent and 
rather different from the scaling exponent found in empirical data (a = 1.4). 

In summary, the GARCH(1,1) process fails to properly describe the scaling properties of 
the S&P500 index detected for At < 1000 minutes. However, GARCH(1,1) is able to give 
an accurate description of the At = 1 minute PDF using as control paremeters (3\ — 0.9 and 
obtaining the values of «o and ot\ from the values of a 2 and k measured from the empirical 
data. 



VI. DISCUSSION 

Our study shows that the problem of the complete stochastic characterization of index 
(or price) dynamics in a financial market is an open question. For example, both models 
considered here have strengths and limitations, and modifications of them are needed to 
reach a more satisfactory agreement with the results of empirical analyses. The problem 
of stochastic modeling of price dynamics comprises fundamental and applied aspects. The 
fundamental aspects are related to the theoretical modeling of a nonlinear complex system 
evolving without known conservation laws in the presence of quenched and external noise. 
The applied aspects are related to the role that the exact shape of the PDF of stock returns 



and the time evolution of the variance of stock returns (volatility in the economic literature 
p9fl), plays in the pricing of derivative financial products ||33|| . An extremely important 
activity performed everyday in financial markets. 

We think that the mixing of empirical analyses, modeling, simulations and comparison 
between empirical data and simulations constitutes a scientific procedure that will allow us 
to eventually find the most accurate and "parsimonious" stochastic model describing index 
(or price) dynamics. 
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